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In this paper we study an abelian version of the notion of return word. Our main result is a new 
characterization of Sturmian words via abelian returns. Namely, we prove that a word is Sturmian if 
and only if each of its factors has two or three abelian returns. In addition, we describe the structure of 
abelian returns in Sturmian words, and discuss connections between abelian returns and periodicity. 

1 Introduction 

Sturmian words can be defined as infinite words having tlie lowest subword complexity among all aperi- 
odic words. Sturmian words have been widely studied due to their fundamental importance in different 
fields of theoretical computer science. For a survey on some results on Sturmian words we refer to 
Sturmian words have many equivalent characterizations, e. g. using balanced words, cutting sequences, 
mechanical words, and via morphisms. In this paper, we develop the approach based on the concept of 
return words. 

The notion of a return word is a powerful tool for studying various problems of combinatorics on 
words, symbolic dynamical systems and number theory. Considering each occurrence of a factor v in an 
infinite word, the set of return words of v is defined to be the set of all distinct words beginning with an 
occurrence of v and ending just before the next occurrence of v. This notion was introduced by F. Durand 
and was used for a characterization of primitive substitutive sequences HI. In lH it was proved that a 
word is Sturmian if and only if each of its factors has two returns; in [3] the proofs were simplified and 
the return words were studied in episturmian words. 

In this paper, we establish a similar result for an abelian analogue of the notion of return word. Two 
words are abelian equivalent, if they are permutations of each other. Different abelian properties of words 
are widely studied nowadays, such as abelian powers, avoidance, complexity, abelian periods, etc. We 
consider return words up to abelian equivalence: defining abelian returns of a factor v of an infinite word, 
we consider all occurrences of factors abelian equivalent to v, and the set of abelian returns is also defined 
up to abelian equivalence. As the main result we prove that a word is Sturmian if and only if each of its 
factors has two or three abelian returns. Notice that the methods we used are different from ones used in 



The paper is organized as follows. After a few preliminary definitions in Section 2, we discuss in 
Section 3 connections between abelian returns and periodicity. In Section 4, we state our main result 
concerning characterization of Sturmian words. In Section 5 we study the structure of abelian returns of 
Sturmian words. We prove that every factor of a Sturmian word has two or three abelian returns; more- 
over, a factor has two abelian returns if and only if it is singular. In Section 6 we prove the sufficiency of 
the condition on the number of abelian returns for a word to be Sturmian. 
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2 Preliminaries 

We begin by presenting some basics on return words together with key definitions we use in the paper. 

Given a finite non-empty set £ (called the alphabet), we denote by £* and respectively, the set of 
finite words and the set of (right) infinite words over the alphabet £. A word v is a. factor (resp. a prefix, 
resp. a suffix) of a word w, if there exist words x, y such that w = xvy (resp. w = vy, resp. w = xv). The 
set of factors of a finite or infinite word w is denoted by F(h'). Given a finite word U — U1U2 ■ • ■ Un with 
n > I and € £, we denote the length « of m by \u\. The empty word will be denoted by e and we set 
|£| = 0. We say that a word w is periodic, if there exists T such that w„+7- = w„ for every n. A word w is 
aperiodic, if it is not periodic. 

Sturmian words can be defined in many different ways. For example, they are infinite words having 
the smallest subword complexity among aperiodic words. The subword complexity of a word is the 
function f{n) defined as the number of its factors of length n. For Sturmian words f{n) = n+l. 

Let w = w\W2 - ■ ■ be an infinite word. The word w is recurrent if each of its factors occurs in- 
finitely many times in w. In this case, for u € F{w), let ni < ?i2 < • • • be all integers such that 
u = w„. . . • w„.+|„|_i. Then the word w„, . . . w„,^[_i is a return word (or briefly return) of u in w. An 
infinite word has k returns, if each of its factors has k returns. The following characterization of Stur- 
mian words via return words was established in [6] : 

Theorem 1. ^6^ A recurrent infinite word has two returns if and only if it is Sturmian. 

Also there exists a simple characterization of periodicity via return words: 

Proposition 1. |0| A recurrent infinite word is ultimately periodic if and only if there exists a factor 
having exactly one return word. 

We now define the basic notions for the abelian case. Given a finite word u = u\U2- ■ - u^ with « > 1 
and Ui G £, for each a G £, we let \u\a denote the number of occurrences of the letter a in u. Two words 
u and V in £* are abelian equivalent if and only if = |v|a for all a G £. We denote it by u v. It is 
easy to see that abelian equivalence is indeed an equivalence relation on £*. 

For an infinite recurrent word w and for u G F{w), let «i < «2 < • • • be all integers such that 
Wn, ■ ■ ■ w„;_|_|„|_i u. Then the word w„, . . . is an abelian return word (or briefly abelian return) 

of u in w. We say that u has k abelian returns, if the set of its abelian returns consists of k abelian classes. 
So, we actually consider abelian classes of returns to abelian classes. 

Example. Consider abelian returns of the factor 01 of the Thue-Morse word 

f = 0110100110010110... 

that is a fixed point of the morphism /i: /^(O) = 01, /^(l) = 10. The abelian class of 01 consists of 
two words 01 and 10. Consider an occurrence of 01 starting at position /, i.e., = 0, = 1. It can 
be followed by either or 10, i.e. we have either f,_|_2 = or ?,+2 = 1> = 0. In the first case we 
have f(+i?,+2 = 10, which is abelian equivalent to 01, and hence we have an abelian return tj = 0. In 
the second case ti^\ti^2 = 11, which is not abelian equivalent to 01, so we consider the next factor 
f;+2f/+3 = 10 ^^^01, which gives the abelian return = 01. Symmetrically, 10 gives abelian returns 
1 and 10. So, in total the abelian class of 01 has three abelian returns: 0, 1 and 01 10. 

In this paper we establish a new characterization of Sturmian words analogous to TheoremU] Namely, 
we prove that a recurrent infinite word is Sturmian if and only if each of its factors has two or three abelian 
returns. On the other hand, contrary to property of being Sturmian, abelian returns do not give a simple 
characterization of periodicity analogous to Proposition [T] 
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3 Abelian returns and periodicity 

First we prove a simple sufficient condition for periodicity: 

Lemma 1. Let |r| = k. If each factor of a recurrent infinite word over the alphabet Z has at most k 
abelian returns, then the word is periodic. 

Proof. Let w be a recurrent word over a ^-letter alphabet, and let v be a factor of w containing all letters 
from the alphabet. Consider two occurrences of v in w, say in positions m and n (with m<n). Then the 
abelian class of w,,, . . . Wn-\ has all letters as abelian returns, and hence no more, because every factor of 
w must have at most k abelian returns. Thus w is periodic with period n — m. □ 

Remark. Actually, this proves something stronger: Let w be any aperiodic word over an alphabet E, 
|r| =k, and let u be any factor of w containing k distinct letters, and let vu be any factor of w distinct 
from u beginning in u. Then the abelian class of v must have at least k abelian returns. It follows that if a 
word is not periodic, then for every positive integer there exists an abelian factor of length > N having 
at least k-\-\ abelian returns. In other words, the value k+\ must be assumed infinitely often. 

Remark. Notice that the condition given by Lemma [T] is not necessary for periodicity. It is not difficult 
to construct a periodic word such that some of its factors have more than k abelian returns. 

Notice also that a characterization of periodicity similar to Proposition [U in terms of abelian returns 
does not exist. Moreover, in the case of abelian returns it does not hold in both directions. Consider an 
infinite aperiodic word of the form { 1 10010, 1 10100}'*' . It is easy to see that the factor 1 1 has one abelian 
return 110010 ^a"^ 110100. So, the existence of a factor having one abelian return does not guarantee 
periodicity. The converse is not true as well: there exists a periodic word such that each of its factors has 
at least two abelian returns. The example is given by the following word with period 24: 

^=(001101001011001100110011)^". 

To check that every factor of this word has at least two abelian returns, one can check the factors up to 
the length 12. If we denote the period of w by m, then every factor v of length 12 < Z < 24 has the same 
abelian returns as abelian class of words of length 24 — / obtained from u by deleting v. For a factor of 
length longer than 24 its abeUan returns coincide with abelian returns of part of this factor obtained by 
shortening it by u. 

4 Characterization of Sturmian words 

The main result of this paper is the following characterization of Sturmian words: 

Theorem 2. An aperiodic recurrent infinite word is Sturmian if and only if each of its factors has two or 
three abelian returns. 

We prove this theorem in the following two sections. The necessity of the condition on the number of 
abelian returns is proved in Section 5, Proposition [3l the sufficiency is proved in Section 6, Proposition 
m Due to space limitations, we give only a sketch of the proof omitting some of the details. We also 
establish some properties of abelian returns of Sturmian words, e. g., we show that a factor of a Sturmian 
word has two abelian returns if and only if it is singular (Section 5, Theorem |4l). 
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5 The structure of abelian returns of Sturmian words 

In this section we prove the "only if" part of Theorem |2l and in addition we establish some properties 
concerning the structure of abelian returns of Sturmian words. 

To describe the abelian returns for Sturmian words, we need to recall some notation. A factor u of 
an infinite word w is called right special {left special), if ua, ub {au, bu) are factors of w for two distinct 
letters a, b. For a Sturmian word there exists exactly one right special factor of a fixed length. Note 
also that the set of factors of a Sturmian word is closed under reversal. A factor is bispecial, if it is 
right and left special. A factor of a Sturmian word is called singular if it is the only factor in its abelian 
class. Notice that singular factors have the form aBa, where t? is a letter and B is a bispecial factor. The 
following proposition follows directly from definitions and basic properties of Sturmian words: 

Proposition 2. Abelian returns of factors of a Sturmian word are either letters or of the form aBb, where 
a^b are letters, and B is a bispecial factor. 

Proof. Consider abelian return to a factor v of length n starting at position /. If w, = w;+„, then the 
letter w\ is abelian return. If w,- = a, Wi^n = b, a ^ b, then there exists ^ > 0, such that w^+i . .-Wi+k = 
w;+i+„ . ..Wi+k+n, and Wi+jt+i Wi^k+i+n- Since w is balanced, we have that Wi^k+i = b, Wi+k+\+n = a. 
So, WiJ^k+i ■ ■ ■ Wi+k+n+\ V, and w, . . . Wj^t+i • • • Wi+k+n+i is abelian return to v. By definition 

the factor w,+i . . . Wj^k = w,+i+„ . . . w,-+i:+„ is bispecial. □ 

Corollary 1. In the case of Sturmian words, for each length I >2 there exists at most one abelian return 
of length I. 

Now we proceed to the "only if part of Theorem |2l 
Proposition 3. Each factor of a Sturmian word has two or three abelian returns. 

The proof of this proposition is based on the characterization of balanced words presented in [2J. We 
will need some notation from the paper. 

Suppose \ < p < q we positive integers such that gcA{p,q) = 1. Let Wp^q denote the set of all 
words w G {0,1}^ with \w\\ = p. If w G Wp^q then the symbol 1 occurs with frequency p/q in w. 
Define the shift a : {0, l}*" ^ {0, 1}'" by a{w)i = >v;+i. Similarly define a : {0, 1}'? ^ {0, 1}'? by 

<J{W0 . . .Wq-l) = Wl . . .Wq-lWQ. 

Since gcd(jE',^) = 1 then any element of Wp^q has the least period q under the shift map a. We will 
write w ~ w' if there exists < ^ < ^ — 1 such that w' = a^(>v). In this case we say that w, w' are cyclically 
conjugate, or that w, w' are cyclic shifts of one another. The equivalence class {cj'(w) : < / < ^} of 
each w G Wp^q contains exactly q elements. Let 

Wp,q = WpJ - 

denote the corresponding quotient. Elements of Wp,^ are called orbits. It will usually be convenient to 
denote an equivalence class in Wp,^ by one of its elements w. 
Given an orbit [w] G Wp,^, let 

>V(0) <LH'(i) <l ••• <L 

denote the lexicographic ordering of its elements. Define the lexicographic array A\w\ of the orbit [w] 
to be the qy. q matrix whose /th row is We will index this array by < /, 7 < ^ — 1, so that 
A[w] = (A[w],-y)f jJ^Q. For <i,j<q— 1, let W(^i)[j] denote the length-(j + 1) prefix of W(,-); so the h'(,-)[7] 
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are the length- (7 + 1) factors of w, counted with multiplicity. For each j this induces the following 
lexicographic ordering: 

W{Q)[j] <LW^l)[j] <L-- - <LW(^_i)[j]. 

Theorem 3. [2| Suppose w S {0, 1}'^. The following are equivalent: 
(\)w is a balanced word, 

(2) \w(i)[i]\i < \w{i+l)[j]\iforall0<i<q-2and0<j<q-l. 

The following proposition from ID gives a very practical way of writing down the lexicographic 
array associated to a balanced word. 

Proposition 4. 121 Let [w] be the unique balanced orbit in Wp^q. Define u G '^p,cj by 



:0...01_^ 

p 



Then, for < i,j < q — \, 
{\)A[w]ij = {ajPu)i, 

(2) The jth column ofA[w] is (the vector transpose of) the word o^^u 

(3) >v(,) = Ui{aPuUa^Pu)i . . . (at^-i)^^),-. 

Example. Consider a balanced word w = 0101001 € ^p^q- The lexicographic ordering of \w] is 

0010101 <L 0100101 <L 0101001 <L 0101010 <L 1001010 <L 1010010 <L 1010100, 

so the corresponding lexicographic array is 

/ 1 1 1 \ 
10 10 1 
10 10 1 
A[w] = 10 10 10 
10 10 10 
10 10 10 

V 1 1 1 / 



We now apply the above technique for studying abelian returns as follows: 

Fix a Sturmian word s and a factor v. We consider a standard factor w (see, e. g., fT|) of s of long 
enough length to contain v and all abelian returns to v. Let |>v| = q, \w\ \ = p. Then all the conjugates of 
w are factors of s, they are pairwise distinct, and gcd(;7,^) = 1 (see, e. g. Q). To be definite, we assume 
that V is "poor" in 1-s, i.e., it contains fewer I's than the unique other abehan class of the same length. 
Then if we consider in A[w] the words W(,) [j], we have that there exists n < q—\ such that W(,) [j] v 
for < / < «, and W(,) [j] 96"* v for n < / < ^ — 1. Note also that A[w],„, = A[w](,-^^^_p)(„,_|_i); from now on 
the indices are taken modulo q. 

The lexicographic array allows to find abelian returns to v in the following way. For a word u denote 
by u[m, I] the factor m,„ . . . m/. If for an /, < / < n, we have W(,) [k, k + 7] k,"^ v and k is the minimal 
such length, then >V(,) [k — 1] is abelian return to v. Notice also that if A[w](,_i)^ = 1 and A[w];,(: = 0, then 
W(^) [k, k + j] V for m = /,...,/ + «. I. e., we have exactly n + l words from the abelian class of v 
starting in every column, and these words are in consecutive n + l rows (the first and the last row are 
considered as consecutive). 
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Example. Consider abelian returns to the abelian class of 001 in the example above. Wf,) [2] 001 for 

< / <4; W(i)[l,3] Ri^'^001 for / = 4,5,6,0,1, W(,)[2,4] 001 for / = 1, . . . ,5. So, the abelian returns 
arew(o)[0] =W(i)[0] =0, W(4)[0] = 1, W(2)[l] = W(3)[l] = 01. 

Proof of Proposition |5] Suppose that some factor v of length 7 + 1 has 4 abelian returns, to be definite 
let this factor be poor in 1 , and in the lexicographic array, rows 0...n start with factors from the abelian 
class of V. By Corollary [T]there can be at most one abelian return of a fixed length greater than 1 (length 

1 will be considered separately), so in a lexicographic array we have one of the following situations: 

1) there exist ki < k2 and n\ <n2 <n such that w,[j] has abelian returns of length ^1 for / = 1, . . . ,n\, 
Wi[i\ has abelian returns of length ^2 for / = ni + 1, . . . ,?i2> and w„2+i [j] has abelian returns of length 
greater than k2, 

2) symmetric case: there exist k\ < kj and n\ <n2 <n such that Wi[j] has abelian returns of length ^2 
for / = ?ii + 1, . . . ,n2> has abelian returns of length k\ for i = n2 + \, ■ ■ ■ ,n, and w,^ [j] has abelian 
returns of length greater than ^2- 

We consider case 1) (for case 2) the proof is similar). First, in case 1) one can notice that the words 
Wni[k\,k\ +q] and ^,,2 [A;2 , ^2 + ^] coincide. So if we consider abelian returns "to the left" of the words 
w„j [k\ , k\ + j] and [k2 , ^2 + j] , they should be the same, but they are not: the first one is of length ki , 
the second one is of length k2. 

It remains to consider the case when v has both letters as abelian returns. It can be seen directly from 
the lexicographic array, that the third and the last return is 01 (in this case after a word not from abelian 
class of V we will necessarily have a word from abelian class of v, i.e., the longest possible length of 
abelian return is 2). □ 

Theorem 4. A factor of a Sturmian word has two abelian returns if and only if it is singular. 

Proof. The method of the proof is similar to the proof of Proposition[3]and relies upon the characterization 
of balanced words from [2]. 

If a factor is singular, then it is the only word in its abelian class, so its abelian returns coincide with 
usual returns. Since every factor of a Sturmian word has two returns |61, then a singular factor has two 
abelian returns. 

Now we will prove the converse, i.e., that if a factor v, |v| = j + 1 of a Sturmian word s has two 
abelian returns, then it is singular. 

As in the proof of Proposition [3l we consider a standard factor w of 5 of long enough length to contain 
V and all abelian returns to v, and denote \w\ = q, \w\i = p. Without loss of generality we again assume 
that V is "poor" in 1-s, so that there exists n <q—\ such that W(,) [j] ss"^ v for < / < n, and W(,) [j] 96"* v 
for n < / < ^ — 1. 

It is not difficult to see that two abelian returns are possible in one of the following cases: 

Case 1) there exist < m < n, < ki,k2 < q such that W(,)[/:i — 1] is abelian return for all <i <m, 
W(,) [k2 — 1] is abelian return for all m + 1 < / < 

Case 2) there exist < mi < m2 < n, < ki < k2 < q such that h'(,)[^i — 1] is abelian return for all 
< / < nil and ni2 + 1 < / < n; wj,) [^2 — 1] is abelian return for all mi + 1 < / < m2. 

Case 1) In case 1) we will assume that ki < k2, the proof in case ^2 < ki is symmetric. We will consider 
two subcases: 

Case la)A[w]m/(:2 = l>^M{m+i)jt2 =0- This means thatW(/)[fc2,^2 + j] v for / = m + 1, . . . ,m + « + 1, 
and A[w]„,(yt2-i) = 0, A[w](^m+i){k2-i) = 1- So, the element A[w](m+i)i2 ^ left-upper element of a block 
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of abelian class of v, and A[iv]„,(i2-i) ^ right-lower element of another such block. It is easy to see that 
the latter block starts in column ki. Therefore, |v| = 7 + 1 = ^2 — < ^2- 

In case la) we will prove that the abelian class of v consists of a single word, i.e., W(,)[7] = v for 
/ = 0, . . . ,11. Suppose that wj,) [j] / ^(,+1) [j] for some i £ {0, — 1}. Since the rows grow lexico- 
gaphically, it means that there exists 0</<7<^2 — 1 such that A[w]ii = 0, A[w](,+i)/ = 1. Hence 
= 1' ^M(!+i)(/+i) = 0' ^iid so W(;^i)[Z + 1,/ + 1+ j] V. If m < /+ 1 < w, then the word 
[j] has return h'(;+i) [/], which is impossible, because it has return W(,) [^2]- Similarly we get that the 
case < J + 1 <m and I + I < k\ is impossible. 

In case < / + 1 <m and fci < / + 1 < ^2 we get that the word W(;+i) [^1 , k\ + 7] has return [ki , I] 
of length / — ^1 + 1 . But in this case W(;) [1 + 1,1+1 + j] k,"^ v for f = / + 1 , . . . , / + 1 + «. Contradiction with 
the condition that W{t) [kj — 1] is abelian return to W(,) [7]. So, the case < / + 1 <m and ki < 1 + I < k2 
is impossible. Hence Wj^,) [7] = [7] for / = 0, . . . ,n — 1, i.e., the abelian class of v consists of a single 
word. 

Case lb)A[w]m/t2 = or A[w](,„+i)^., = 1. This means that W(m) [^2,^2 + j] v. Hence the word W(„)[7'] 
has abelian return W(„)[^2] of length k2 + l, and the word W(„,)[^i,^i + 7] has abelian return W(,„)[^i,^2] 
of length k2 — ki + I, so the returns are different. This is impossible since h'(„) = H'(m) [^1 , ^1 + ^ — 1]- 

Case 2) In case 2) the fact that wj,) [ki] is abelian return for all < / < mi — 1 and m2 + l <i <n implies 
that n > q/2. So, ^1 = 1, i.e., we necessarily have return(s) of length 1. Since there are two abelian 
returns totally, we can have only one return of length 1, and this return is 0. It means that A[h'],o = for 
0</<?i. Since W(„2) [1,7 + 1] ^^''^ v and W(,„2+i)[l,7 + 1] v, we have AM^ji = I'^^M^+i)! = 0. 
and hence A[w],„20 = 0, A [w] (,,,2+ 1)0 = 1- We get a contradiction with A[h'],o = for < / < «. 

So, the converse is proved, i.e., every factor of a Sturmian word having two abelian returns is singular. 

□ 

6 Proof of Theorem |2]: the sufficiency 

Here we prove the "if part of Theorem [2l i.e., we establish the condition on the number of abelian 
returns forcing a word to be Sturmian: 

Proposition 5. If each factor of an aperiodic recurrent infinite word has two or three abelian returns, 
then the word is Sturmian. 

The proof of this proposition is rather technical, it is based on considering abelian returns to different 
possible factors of the infinite word and consecutive restricting the form of the word. Denote the non- 
periodic word with 2 or 3 abelian returns by w. First we notice that Lemma [T] implies that an aperiodic 
word with 2 or 3 abelian returns must be binary, we denote letters by and 1: w G {0, 1}®. In the rest 
of this section instead of abelian returns "to the left" we consider abelian returns "to the right": if vu is a 
factor having v' v as its suffix, and vu does not contain as factors other words abelian equivalent to 
V besides suffix and prefix, then u is abelian return to v. It is easy to see that no matter of the definition, 
the set of abelian returns to each abelian factor is the same. Though this does not make any essential 
difference, this modification of the definition is more convenient for our proof of this proposition. 

We say that a letter a is isolated in a word w € T®, if aa is not a factor of w. We will make use of the 
following key lemma: 

Lemma 2. If each factor of an aperiodic recurrent infinite word w has at most three abelian returns, 
then one of the letters is isolated. 
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Sketch of proof. In the proof of this lemma we will use the following definition. We say that a letter 
a G S appears in w in a series of length ^ > 0, if a word ba'^c is factor of w for some letters h ^ a, 
c^a. Considering abelian returns to letters, we get that every letter can appear in series of at most three 
different lengths. Denote these lengths for series of O's by l\, h, h, where l\ <l2< h, for series of I's 
by Ji , ji, J3, where j\ < j2 < J3- Notice that a letter can appear in series of only two or one lengths, then 
the third length or the third and the second lengths are missing. 

Consider abelian returns of the word lO'': they are 1, O'^'' 1 for / = I2, h (if appears in series of 
corresponding lengths), y~^0'' for j = j\ > IJiJz (if 1 appears in series of corresponding lengths) and 
for 71 = 1. Some of these returns should be missing or abehan equivalent to others in order to have at 
most three abelian returns totally. So we have the following cases: 

- j2, ji, h are missing, i.e., w G {O'l y , O'^ \h in this case abelian returns are 1, O'^"'' 1, and l^i~^0'' 
for ji > 1 or for j\ = 1. 

- h, h, h are missing, i.e., w G {O'' V\Qf' V^}'". Abelian returns are 1, V^-^Q^\ and yi-^O'i, if ji > 1, 
orO, if ji = 1. 

- 72, 73 are missing, ji = 2,h = 2h or h = ^h, i.e., w G {{O'' ,0^'^ ,0'}V^)^. Abelian returns are 1, O'' 1, 
O'-^'l. 

- h, js are missing, h = 2/i, ji =2 or 72 = 2, w G ({O'l ,0^'! }{l^, V})^. Abehan returns are 1, O'' 1, 
l-'-iO'i (if 7> l)orO(if 7 = 1). 

Notice that the first two cases are symmetric. Considering abelian returns to the word 0, we get 
symmetric cases (0 change places with 1, change places with 4, k = 1,2,3). Combining the cases 
obtained by considering abehan returns to lO'^ with the cases obtained by considering abelian returns to 
y 0, we finally get the following remaining cases (up to renaming letters): 

1) 72, ji, h are missing, i.e. w is of the form w G {O'^ , O'^ }® . 

2) k, 73 are missing, h = hh = 2, 71 = 2, 72 = 4, i.e. w G ({0,02}{l2, l4})«. 

3) h, 73 are missing, h = hh = 2, 71 = 1, ji = 2, i.e. w G ({0,02}{1, l2})'». 

4) h, 73 are missing, h =2,h = 4, 71 = 2, 72 = 4. i.e. w G {{0\0''}{l\ l^})'". 

Case 1): wG {0^1 1{,0'2 1{}'*'. 

In the first case we should prove that 71 = 1. We omit index 1 for brevity: 7 = 71. Suppose that 7 > 1. 
Consider abelian returns to the word lO'^. They are 1, 1^^^(0'' 1^)*^0'^ for all /: > such that the word 
0'^P (0'' 1^)*0'2 is a factor of w. Therefore, we have at most two values of k (probably, including 0). 

Abehan returns to the word y'O'' 1 are 1, (O'^ V)'"0'' 1 for all m > such that the word lO'i 1^(0^ lj)'"0'^ 1 
is a factor of w. So, we have at most two values of m (probably, including 0). 

Taking into account conditions for m and k, which we have just obtained from considering abelian 
returns to both lO'^ and l^O'' 1, we find that there are two opportunities for an aperiodic word w: 

Case la) w G {{{0'njf^,{0'njf^}0'nj)"', 0<ki < k2. The word O'HJO'nj-^ has returns 1, O'a, 

0'2(l^'i)^"U for all k such that the word 0'2 1'(0'i V fo'^ is a factor of w. To provide at most three 
abelian returns, w should admit only one value of k. Hence, w is periodic and case la) is impossible. 

Case lb) w G (0'iP',{(0'n^)'"i,(0'n^)'"2})'*, < < m2. The word VO'nJO^n has returns 1, 10'\ 
lO'i (lio'z)"'-! for all m such that the word lO'i P(0'2 l^)'«0'i 1 is a factor of w. To provide at most three 
abehan returns, w should admit only one value of m. Hence, w is periodic and case lb) is impossible. 
Thus, in case 1) I's are isolated. 

Cases 2)-4) In cases 2)-4) we need to consider words containing all four series, otherwise we get into 
conditions of case 1) in which we proved that 1-s are isolated. The proof is similar for the three cases. 
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and is based on studying abelian returns of certain type. When we examine w G ({O'' ,0'^}, {!■" , l-'^})'*^, 
we consider abelian returns to the words 0''!^^ and O'U^', and with a technical case study obtain that 
if both words have at most three abelian returns, then w is periodic. For brevity, we omit the details of 
proof for cases 2)-4-). □ 

Lemma 3. Ifw € {O'' 1,0'^ 1}'*', < Zi < h, is an aperiodic recurrent word and each of its factors has at 
most three abelian returns, then l2 = li + \. 

Proof. Suppose that l2> h + l. Consider abelian returns to the word 0''+^: it has abelian returns 
and l(0'i l)*^10'i+' for all it > such that 0^1(0'' is a factor of w, thus there could be at most two 
different values of k (probably, including 0). Consider abelian returns to the word lO'' 10: it has abelian 
retumsOand (0'2"ilO)^'0''"4 for all 7 > such that lO'' 1(0'^ 1)^0'' 1 is a factor of w, thus there could be 
at most two different values of k (probably, including 0). Since w is non-periodic, we have two cases: 

Case I: w G (O'^ l{(0'i 1)*^' , (O'' if'-})"^, < ki < k2. In this case one can find four abelian returns to 
O'^IO''-^: 0, 10''-^ {\0''f'-hO'^-\ (10'')*^2-iio'2-i. 

Case II: w G {0'n{{0'ny' ,{0'ny^)'", O < ji < 72. in this case one can find four abelian returns to 
10'210''10: 0, O'2-il, 10)^1 "^O'l^'l, (0'2-ilO)-''2-iO''"il. □ 

The proof of Lemma [2] and Lemma [3] imply 

Corollary 2. If each factor of an infinite aperiodic recurrent word w has two or three abelian returns, 
thenw£{0'n,0'^+h}''. 

Lemma 4. If each of factors of an aperiodic recurrent infinite word w has at most three abelian returns, 
then w is 2-balanced. 

Proof. For a length n, consider abelian classes of factors of length n of such word w. Denote by A the 
abelian class of factors containing the smallest number of 1-s: A = {m G F„(w) : |m|i = mm^,^f^^|^^^,^ Mi}. 
The next class we denote hy B: B = {u e F„(w) : |m|i = min^,e/r^(n,) |v|i + 1}, the next one by C. If w has 
only two abelian classes, then it is Sturmian, so we are interested in the case when w has at least three 
abelian classes. For a length n, we associate to a word w a word ^ over the alphabet of abelian classes 
of w of length n as follows: for an abelian class M of words of length n, ^j^""^ = M iff w^; . . . w^+^-i G M. 

In other words, {^jf^^)k>o is the sequence of abelian classes of consecutive factors of length n in w. 

It is easy to see that (^(") contains the following sequence of classes: CB^^A^^B for some jiji > L 
i.e. for some / we have i?/"' . . . , ,■ , , = CB^'A^^B. Then we have 

Wi = l,w,-+„ = 0, 

Wk = Wk+n for ^ = / + 1, . . . + - 1, 

Wi+j, = l,W,-+y,+„ =0, 

Wk = Wk+n for = / + 71 + 1, . . . , / + 71 + 72, 

^i+h+h =^->Wi+ji+p_+n = 1- 

I. e., Wi... w,+ = ImIvO, Wi+n ■ ■ ■ Wi+ ji+j^+n = OmOvI. 

By Corollary |2] we have w G {O'' 1,0''+^1}'", so \u\ > 2/i + 1; m contains both letters and 1 and has 
a suffix O''. It follows that 72 = 1. So, the class B has the following 3 abelian returns: 0, 1,01. All the 
returns are of length at most 2, so if after an occurrence of B we have C, then the next class is B again, 
otherwise we will get a longer return. So there are no other classes than these. In addition, we proved 
that if for length n there are three abelian classes, then in E, letters A and C are isolated. □ 
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Proof of Proposition^ Due to Corollary |2] and Lemma |4l we have that w is 2-balanced and it is of the 
form {O'' 1,0''+^}'*' for some integer ly. Suppose that w is not 1-balanced. Then there exists n for which 
there exist three classes of abelian equivalence in Fn{w); as above, denote these classes by A, B and C. 
Arguing as in the proof of LemmalU consider a sequence of classes BCB^AB which we necessarily have 
in for some integer j, denote its starting position by / — 1. Corresponding factor in w is 

Wi-\ =0,w/-i+„ = 1, 
Wi = l,w,-+„ =0, 

Wk=Wk+n^OTk = i+\,...i + i -\, 

Wi+j = l,w,-+y+„ =0, 
Wi+j+\ =0,w,-+^-+i+„ = 1. 

1. e., Wi . . = 1"10, w,+„ . . . w,+y+i+H = OmOI. Remark that u = . • • has prefix O'' 10. 

Now consider abelian returns to an abelian class BO = Al of length n+ I. The factor starting from 
the position / + 1 is of the form BO so it belongs to this class, and has an abelian return 0. The word 
starting from the position / + j is of the form BO and has an abelian return 1 . The word starting from the 
position / + /i — 1 belongs to this class, and has an abelian return 01. So we have at least three returns 0, 
1 and 10. Now consider the occurrence of class BO = Al to the left from the position /+ 1. One can see 
that the positions / and / — 1 are from the class Bl = CO, so the preceding occurrence of BO = Al has an 
abelian return of length greater than 2, which is a fourth return, though there should be at most three. So 
we cannot have more than two classes of abelian equivalence in an aperiodic word having two or three 
abelian returns, i.e., such word should be 1-balanced and hence Sturmian. Proposition |5] is proved. □ 

Remark. Actually, in Proposition [5] instead of recurrence property one can consider a weaker property 
of abelian recurrence in the sense that for every factor u of w there exists a factor u' from the abelian 
class of u which occur infinitely many times in w. 
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